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AESTRACT 

Criterion-referenced testing is designed to determine 
the student's ability to perform a specified skill within a 
comprehensive task. To measure the student's degree of development or 
growth, a teacher-made test is used. This test is used as 
criterion-referenced in the measuring of a student's degree of 
achievement of specified behaviors within a delimited parameter. The 
test becomes normative based when the teacher assigns a numerical 
quantity to each paper for the purpose of grade distribution. With 
criterion-referenced tests, measuring growth can be accomplished 
through either positive or negative behavioral observations. With 
established behavioral objectives, teacher-made tests and classwork 
are excellent for criterion referencing. For the skilled 
psychometrician, normative based testing is a limited sampling of 
behavior that is done according to specific prerequisites. "Norm 
testing focuses upon sorting and selecting students for placement 
into the educational curricula. By doing a detailed analysis of 
errors made on normative testing, the examiner can obtain a more 
detailed picture of the child than by using the actual standard score 
only. Thus, the skilled examiner is better able to relate testing 
observatiion to behavior displayed within the classroom setting, as 
well as viewing each test or subtest as to its component parts of 
incoming stimulus, understanding stimuli or expressive reaction. The 
skilled teacher must also, through observation^ view a given task in 
the form of task analysis. From this, both the examiner and the 
teacher can determine the instructional level for the child and agree 
as to the remedial approach to be used. (DB) 
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"SOME UTILITIES OF CRITERION TESTING" 
by Tim Roberts 

The current trend in education is dealing with the indiv- 
idualized curriculum. This curriculum has been used quite 
successfully in the field of special i2ducation and now has grown 
into the regular classroom. With the impact of individualized 
instruction being felt throughout the nation, special assess- 
ment perplexities have also developed. To adequately assess 
the individualized curriculum, criterion-referenced testing 
has been introduced. 

Cr i ter i on-ref erenced testing is new terminology for old 
concepts. In Ancient China, leaders were chosen by their 
ability to perform at specific tasks that were deemed necessary 
for leadership. Today with the advent of c r 1 ter i on- re f erenced 
testing, children are assessed in relationship to those skills 
that are deemed as necessary to perform a given task or to attain 
success in the educational process. Within the last 70 years 
testing has moved from criterion to the normative based test 
with the pendulum now beginning to swing back. Binet originally 
observed people's behavior and concluded that to experience 
success, specific objectives, skill levels'or cricerions must 
be reached before the next level of development can be attained. 
The observation of large groups developed a norming base upon 
which an individual could be compared. This comparison led to 
the concepts of mental age, grade equivalents and intelligence 
quotient. 



The mental age or grade level concept became a way to 
describe a person's developmental level, which was based on 
specific criterions of development, as compared to the dev- 
elopmental level achieved by the majority of his peers. 
Normative testing is most simply described as the process of 
defining an individual's level of skill attainment and relating 
that attained level to a pre-established group standard. Norm- 
ative testing data is interpreted through converting raw scores 
to standard score equivalents of percentiles, grade equivalent, 
scaled scores, stanines, etc., and not by the degree of con- 
sistency nor hierarchial attainment of skill of the students. 
Gross normative test data results are derived through sampling 
the student's ability to answer questions representative of 
the total range of skills inherent within the specified outcome 
desired. Criterion-referenced testing, on the other hand, is 
designed to determine the student's ability to perform a speci- 
fied skill within a comprehensive task. This involves the 
ability to systematically analyze any given task as to the 
component parts necessary to perform successfully the determined 
task. It further makes necessary the successful completion of 
each hierarchial component before continuing to the next higher 
developmental stage. Therefore, measurement is done by assessing 
the child's ability to consistently perform at a given level 
of qual i ty • 

In every unit that a teacher wishes to teach specific objectives 
are designed as significant parts of the unit. To measure the 



student's degree of development or growth, a teacher-made test 
io used. This test is comprised of specific items that measures 
beha.v-iors deemed essential to the understanding of the unit. 
From this test, a teacher receives insights as to the level of 
knowledge a student has gained as well as insights for the 
student's success potential in the next higher level of develop- 
ment based upon his current knowledge of necessary sub-skills 
within the teaching hierarchial pardigm. The teacher-made test 
is then used as criterion-referenced in the measuring of a student 
degree of achievement of specified behaviors within a delimited 
parameter. This teacher-made test becomes normative based when 
the teacher assigns a numerical quantity to each child's paper 
for the purpose of grade distribution. When qualitative distri- 
butions are assigned from a criterion-referenced test the purpose 
served becomes of as much value as the normative-based test. 
Thus, we as teachers must make a value judgment as to what degree 
or level the child has achieved as well as his proficiency of 
the material learned. By placing a proficiency rate, e.g. 70%, 
on a given skill or sub-skill, we are comparing the child to a 
normative group proficiency and not to his ability to perform on 
the task. For example, related sub-skills can be taught as a 
building process without the attained proficiency rate. If sub- 
skills are delineated, each lower sub-skill will increase in 
proficiency as each higher sub-skill is introduced. By using 
a criterion -referenced test an assessment of a child's growth 
can be measured and evaluated while attempting the next hierarchia 



level of functioning within a specified learning process. 

Criterion-referenced tests should be thought of in terms 
of the educational system; if any skill or task is worth teaching 
it is v/orth measuring for growth. Measuring the growth can be 
accomplished through either positive or negative behavioral 
observations. Positive behavioral observations can be accom- 
plished by establishing behavioral objectives that a"^e measurable 
by successfully meeting each criterion in the objectives. Success 
is measured upon the consistency of growth. Negative observations 
are accomplished by measuring the lessening of error frequency 
within each of the objectives. 

By establishing behavioral objectives, two prominent teaching 
tasks are excellent for criterion-referencing. These tasks are 
teacher-made tests and class work. Classwork may need to be 
developed for assessing the child's educational strengths and 
weaknesses. This may need to be broken down not only in sequential 
learning steps but also those skills that are needed to do the 
task. If the behavior a child is displaying can be delineated 
to minute points, criterion-referenced testing can be of tremendous 
value. 

For the skilled psychometri cian , normative based testing i s- 
a limited sampling of behavior that is done according to specific 
prerequisites. The use of norm testing is focused upon that of 
sorting and selection of students for placement into the educational 



c u r r 1 c u la that have the potential for meeting the child's educa- 
tional needs. Meaningful information must be passed between the 
psy c home tr i c i an and teacher. Terminology such as "Similarities", 
"Block Design", "Visual Sequential Memory", and "Auditory Closure" 
have little meaning to those working with the child if this 
behavior is not noticeable or defined so that all have a common 
reference point in viewing the child. When the child is compared 
to the norming population and to the norm of the test, he is 
forced into a comparative situation which gives little indication 
of specific skill deficiency. Thus, when the child is compared 
to the norm we are doing very little to determine where his 
strengths and limitations are within the learning process and 
what remedial activities should be undertaken. By doing a 
detailed analysis of errors made on normative "^-esting, the 
examiner can get a much more detailed outlook of the child than 
what the actual standard scores relate. With this approach, the 
skilled examiner is better able to relate testing observation 
to behavior displayed within the classroom setting as well as 
viewing each test or subtest as to its component parts of in- 
coming stimulus, understanding stimuli or expressive reaction. 
Thus, the skilled examiner ca\i use a cri ter i on-^ref erenced 
approach for determination of specific objectives lacking in 
development. Through observation of the child the skilled teacher 
must also view a given task in the form of task analysis. From 
this, both the examiner and teacher can determine the instructional 



level for the child and both will be in agreement as to the 
remedial approach used to best profit the child. 

By using a comb i nation of normative based testing and 
cr i tjri on-referenced testing, determination of the child's needs 
can be focused upon. This can be viewed as a check and balance 
situation. If for example it is found 'chat a child displaying 
a particular behavior in for^mal testing, it serves little 
remedirj purpose if it is not observable within the classroom 
or affecting the child's total classroom performance. Thus, 
if synthesizing is not affected, the diminutive behavior may 
not need to be worked with. Through criterion-referenced testing, 
evaluation of the synthesizing can be assessed. If it is found 
that a child has difficulty interpreting the correct word 
meaning within the sequence of directions, criterion-referenced 
testing can show this as transferring into the classroom with 
the development of a checklist that 1s comprised of behavior 
sampled in normative testing situation. 

These two forms of testing need not be used against each 
other. They are a means of bringing the teacher and psyc home tr i s t 
closer in their dealings with a child's need. When used together 
the determination of activities to maximize student learning can 
be made clearer. Neither can nor should stand on their own; 
but, both should be used to compliment the other. 



